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Abstract 

0^ ' Let {\tpi} , . • . , \ip n ) \pii ■ ■ ■ ,Pn} be an ensemble of pure quantum states. We show that it 

is possible to increase all of the pairwise overlaps KV'ilV'i)! i- e - make each constituent pair of 
the states more parallel (while keeping the prior probabilities the same), in such a way that 
the von Neumann entropy S is increased, and dually, make all pairs more orthogonal while 
decreasing S. We show that this phenomenon cannot occur for ensembles in two dimensions 
but that it is a feature of almost all ensembles of three states in three dimensions. It is known 
that the von Neumann entropy characterises the classical and quantum information capaci- 
ties of the ensemble and we argue that information capacity, in turn, is a manifestation of 
the distinguishability of the signal states. Hence our result shows that the notion of distin- 
guishability within an ensemble is a global property that cannot be reduced to considering 
distinguishability of each constituent pair of states. 
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0\ • 1 Introduction 

,-£h ' The interpretation and physical significance of non-orthogonality is one of the fundamental enig- 
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mas in the foundations of quantum theory. Let \(j>) and |x) be two non-orthogonal states of a 
quantum system. We may decompose \<j)) as a superposition of components parallel and perpen- 
dicular to |x) 

\<t>)=a\ X )+b x 1 -) (1) 

where (x^lx) = an d a = (x|</>). Since any time evolution in quantum mechanics is unitary 

(when we include also the state of any ambient environment) |x) and x ) W1 U evolve as though 
independent, remaining orthogonal, and the decomposition in (p]) is preserved. Thus we see 
that the overlap | (x\4>) I measures the extent to which the state \(p) behaves as though it were 
actually equal to the state | x) • This view is further formalised in the many worlds interpretation 
of quantum theory according to which ([l]) may be thought of as a "splitting" into two "worlds". 
In one of these worlds the state | <f>) is indeed actually precisely equal to | x) • 

The overlap (by which we will always mean the absolute value of the inner product) is also a 
fundamental ingredient in the question of (non-)distinguishability of quantum states. In standard 
quantum measurement theory | (x|</>) 1 2 is the probability that \(j>) passes the test of "being the 
state |x)"- Although and |x) are distinct states in the mathematical formalism of quantum 
theory, there is no physical process that can distinguish them with certainty and indeed the overlap 
provides a quantitative measure of the extent to which the states cannot be distinguished (as for 
example, in the Peres measurement Q for optimal distinguishability of non-orthogonal states. 
This is in contrast to distinct states in classical physics which are always perfectly distinguishable 
in principle. 

1 Present address: Rohde & Schwarz GmbH & Co. KG, Test and Measurement Division, P.O. Box 80 14 69 
D-81614 Munich, Miihldorfstr. 15 D-81671 Munich, Federal Republic of Germany. 
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The purpose of this paper is to describe a situation which appears to contradict the above 
intuitions. We will describe a situation in which quantum states actually become more distin- 
guishable (in a certain natural sense) when they are made more parallel i.e. when their overlap 
increases. Our notion of distinguishability will be based on information-theoretic considerations 
and will rest on the concept of von Neumann entropy. Recent work in quantum information the- 
ory ||, H|, [|, ^| has shown that this alternative quantification of distinguishability is very natural 
and compelling. Indeed if we view quantum states as carriers of information then their capacity 
for embodying information is a very natural measure of distinguishability i.e. a set of states can 
communicate more information if and only if the states are made more "distinguishable" (all 
else, such as prior probabilities remaining the same). Quantum states may be used to carry two 
different kinds of information, classical and quantum information, and we will first briefly outline 
the essential results characterising the respective information capacities which form the basis of 
our quantification of distinguishability in terms of von Neumann entropy. 

Consider first the case of quantum information. Suppose that Alice has a source which emits 
an unending sequence of qubit signal states = |0) and ^2) = 775 ( 1 0) + Each emission is 

chosen to be or |t/> 2 ) with an equal prior probability a half. Let p = \ (^ll + 5 I ^2) (^2 1 
be the density matrix of the source and let S = S(p) = — Tr / olog 2 p be its von Neumann entropy. 
Alice wishes to communicate the sequence of states to Bob. Clearly this may be achieved by 
transmitting one qubit per emitted state but according to the quantum source coding theorem 
]|, f|, P she can communicate the quantum information (with arbitrarily high fidelity) using 
(asymptotically) only S qubits per state. Furthermore this compression is optimal: no fewer 
number of qubits per signal can achieve this task. In our example a direct calculation gives that 
S = 0.601 qubits per signal. 

Now consider the analogous situation in classical physics: if we have two classical signals 
with equal prior probabilities of a half then no compression beyond 1 bit per signal is possible (by 
Shannon's source coding theorem [0]) and one may ask what is the origin of the extra non-classical 
compression in the quantum case? Clearly it is related to the overlap: if 9 = cos -1 | (ipil^) I is 
the angle between the two signal states then S depends only on and increases monotonically 
from to 1 as 6 increases from to tt/2 (corresponding to the classical situation). In view of (|l]) 
and the discussion following it, one is tempted to think of | {tpi | ^2} I as representing a redundancy 
or overlap of quantum information between and ^2) which may be "compressed out" i.e. to 
some extent and \ip2) are the "same" and this common quantum information in every signal, 
already known to Bob, need not be sent. 

More generally if we have an ensemble of signal states , . . . ,\ip n )} with prior probabilities 
Px, ■ ■ ■ ,p n then the quantum source coding theorem asserts that the quantum information may be 
compressed to S qubits per signal where S is the von Neumann entropy of p = J2iPi (V^l and 
that this compression is optimal. Note that the von Neumann entropy S(p) is always less than 
or equal to the Shannon entropy H(pi, . . . ,p n ) || and we might think of the extra quantum 
compression to S qubits beyond the classical limit of H bits per signal as being due to the overlap 
of the quantum information represented by the constituent states as expressed in ([[]) . Our results 
below will imply that this interpretation is incorrect. Hence the origin of the extra quantum 
compression is evidently more subtle. 

Let {\4>i) , • • • , \<f> n ) • • • ,Pn} denote the ensemble of quantum states \4>i) taken with prior 
probabilities p, L respectively. Let 

£ = {|V>i) ,.. . , \ip n ) ;px,.. . ,p n } and £ = • • • , \tpn);Pl, ■ ■ ■ ,Pn} 

be two ensembles with the same number of states and with the same corresponding prior proba- 
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bilities. Let 

p = ^2pi\^i) (A\ p = ^2pi\A)(A\ 

i i 

be the respective density matrices and let S and S be the von Neumann entropies. We will show 
that it is possible to have the following situation: the states of £ are all pairwise more parallel 
(i.e. have greater overlap) than the corresponding states of £ yet the von Neumann entropy of £ 
is greater than that of £ i.e. we simultaneously have 

(ENS1): | k/5,-)| < for alii, j 

(ENS2): S < S 

This is in contradiction to our intuitive discussion above. Each pair of states of £ has a greater 
overlap of quantum information than the corresponding states of £ in the sense of (jl|) yet as a 
totality they embody more quantum information. 

We will say that an ensemble £2 is a deformation of £\ if they have the same number of states 
and the two ensembles have the same list of prior probabilities (in the given order) i.e. the states 
of £1 are being thought of as obtained by "deforming" the corresponding states of £\ while keeping 
the probabilities fixed. 

We will show that (ENS1) with (ENS2) can never be satisfied for any pair of ensembles £ and 
£ in two dimensions (regardless of the number of states) but that for almost all ensembles £ of 
three states in three dimensions, there is an ensemble £ satisfying (ENS1) and (ENS2). 

The phenomenon in (ENS1) and (ENS2) shows curiously, that information capacity (or dis- 
tinguishability) is a global property of a set of states and not an accumulative local property of 
pairs of constituent states. Indeed for ensembles {\ipi) , ^2) ',Pi,P2} of just two states, the von 
Neumann entropy is a monotonically decreasing function of the overlap | (ipi\ip2) I- Thus the over- 
lap conditions in (ENS1) imply that each constituent pair of signals has a diminished capacity for 
information as we pass from £ to £ yet (ENS2) states that the ensemble as a whole develops an 
increased capacity. 

Von Neumann entropy also characterises the classical information capacity of an ensemble of 
quantum states. Suppose that Alice is constrained to use the states \ipi) with prior probabilities pi 
and she wishes to communicate classical information to Bob. On receiving a string of states, Bob 
is allowed to perform any joint measurement on a signal block of any length in order to maximise 
his acquired mutual information about the identity of the states. Then it may be shown ^ that 
the von Neumann entropy S of the signal ensemble gives the maximum amount of information per 
signal that Alice is able to reliably transmit to Bob under the above constraints. Now classical 
information capacity is very closely related to the concept of distinguishability, which, by any 
definition, is itself a form of classical information about the identity of the states. Then (ENS1) 
with (ENS2) shows that the members of an ensemble of quantum states can become pairwise less 
distinguishable yet as a whole the ensemble becomes more distinguishable i.e. has more classical 
information capacity. 

2 The Gram Matrix Formulation 

We now describe a formalism for studying the conditions represented by (ENS1) and (ENS2) and 
give a method for generating realisations in dimension d > 2. 

Consider an ensemble {(^l) > • • • > \ipn) ]Pi, ■ ■ ■ ,Pn} of n states in d dimensions. The density 
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matrix is 

n 

P = 5>ihfc)M ( 2 ) 

i=l 

We introduce the Gram matrix G defined as the n x n matrix of re-scaled inner products: 

Gij = ^/Wj (3) 
The Gram matrix enjoys the following two fundamental properties: 

(Gl): The non-zero eigenvalues of G are the same as the non-zero eigenvalues of p (and their 
respective multiplicities are also the same). Note that in general d 7^ n so the mismatch in 
the numbers of eigenvalues is made up by zero eigenvalues. It follows that p and G also 
have the same von Neumann entropy. 

To see this, introduce n orthogonal vectors in an auxiliary Hilbert space and consider 
the pure state 

n 

l$ = £vWi>|ei> (4) 
i=i 

Then p and G are just the two reduced states obtained by partial trace of \4>) (4>\ over 
the second and first components respectively. Hence they must have the same non-zero 
eigenvalues (c.f. appendix of J?]]). □ 

(G2): G is always a positive matrix and Tr G = 1. Conversely, if A is any mxm positive matrix 
with Tr A = 1 then A is the Gram matrix of an ensemble of m states in m dimensions ||. 

The first part follows immediately from eq. @ where G is identified as a density matrix 
itself. For the converse statement note that if A is positive we can write A = B 2 where 
B is Hermitian so A = B^B. Let bi be the normalised i th column of B and let t{ be its 
squared length. Then A = B^B expresses precisely the fact that A is the Gram matrix of 
the ensemble of m-dimensional states {61, . . . , b m ; ti, . . . , t m }. The probabilities ti are just 
the diagonal entries of A.D 

The Gram matrix, expressed in terms of the inner products rather than the states themselves, 
provides a natural vehicle for studying the conditions (ENS1) and (ENS2). Indeed we are generally 
not interested in the actual positions of the ensemble states but only in their relative positions 
i.e. inner products. The following theorem shows that the Gram matrix encodes this information 
while eliminating the superfluous data of overall unitary repositionings: 
Lemma 1: Two ensembles 

£1 = {|«i> Wm) ;Pli ■ ■ ■ ,Pm} on Hi and £ 2 = , ... , \/3 n );qi, ...,q n } on H 2 

have equal Gram matrices Gi = G 2 if and only if m = n, pi = qi for i = 1, . . . , m and there is a 
unitary transformation U on Hi © H 2 with \(3i) = U \ai) for all i.D 
We give the proof in the appendix. 

Let G and G be respectively the Gram matrices of the ensembles £ and £, which have the 
same prior state probabilities (i.e. G and G have the same diagonals). Then (ENS1) is equivalent 
to \Gij\ < \Gij\ for each i and j, i.e. 

Gij = TijGij rij G C and |rjj| < 1 for each i, j (5) 

The matrix r = [r^] satisfies the following properties: 
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(Rl) r is Hermitian (without loss of generality) 

(R2) the diagonal entries ra are all equal to 1 and |rj,-| < 1 for all i and j. 

However for given G, r cannot be chosen arbitrarily subject to (Rl) and (R2) because G is 
required to be a positive matrix (by (G2)). Also we wish to choose r so that eq. (|5|) induces a 
decrease in the entropy of the Gram matrix G. 

The componentwise product of G and r in (jHJ) is known as the Hadamard product of matrices. 
We denote it by 

G = r * G (6) 

to distinguish it from the usual matrix product. The Schur product theorem Q asserts that the 
Hadamard product of any positive matrices is again a positive matrix. Hence if r is chosen to 
be positive then by (G2) G will again correspond to an ensemble of states. However in this case 
(ENS2) can never be satisfied i.e. the entropy is non-decreasing. To see this, let G be the Gram 
matrix of an ensemble £ comprising n states with probabilities pi. If r satisfying (Rl) and 
(R2) is positive then (G2) implies that is also a Gram matrix of some collection of states, 
, • • • , |£n) say, taken with equal prior probabilities K Thus 

r ij = (&|&) 

Hence (||) asserts that G is the Gram matrix of the ensemble £ (£) comprising the states <8> 
with probabilities p%. Thus £ (£) is an "extension" of £ obtained by simply adjoining the states 

to the corresponding |^)'s. As such, the entropy S(£) of £(£) can never be smaller than the 
entropy S of £. We give three brief proofs of this fact, each invoking a different (substantial) 
theorem. Firstly if S(£) < S then Alice could reliably communicate the quantum information 
of £ to Bob using less than S qubits per signal. She simply adjoins the states compresses 
to £*(£) qubits per signal and on reception and decompression, Bob just discards the extensions. 
This contradicts the quantum noiseless coding theorem H ||, ||. Secondly, in a similar way, 
S(£) < S contradicts the classical information capacity theorem j|] (which asserts that the von 
Neumann entropy is the classical information capacity): the extended ensemble £ (£) cannot have 
a smaller information capacity since Alice and Bob can always just ignore the presence of the 
extensions for the purposes of classical communication. Thirdly, the passage from to 
\ipi ) (i.e. discarding the second state) is a physically realisable operation and hence a CP map. 
Then Uhlmann's monotonicity theorem Jlc| |i~l| ] (asserting that relative entropy can never increase 
under any CP map) immediately implies that the entropy of £ cannot exceed the entropy of £ (£) 
(since the entropy of any ensemble of pure states pi = \<pi) (4>i\ is just the average relative entropy 
J2iPiS(p\\Pi) where p = J2iPiPi is the ensemble density matrix). 

Hence we have: 

Lemma 2: If we wish to satisfy (ENS2) together with (ENS1) it is necessary that the matrix r 
of multipliers in eq. (|5|) have at least one negative eigenvalue. □ 

An example described later (having G positive) will show that this necessary condition is 
unfortunately not also sufficient. 

3 Ensembles in two dimensions 

We begin by proving: 

Lemma 3: (ENS1) and (ENS2) can never be simultaneously satisfied for any ensembles in two 
dimensions. □ 
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For the case of two states (n = 2) we have already noted that lemma 3 follows readily from 
the explicit formula for von Neumann entropy which, for any p\ and P2, depends monotonically 
on the overlap of the two states. Alternatively in this case we note that r is a 2 by 2 matrix which 
by (Rl) and (R2) must take the form 

_ / 1 e 1 ^ cos a \ 
I e~ l/3 cos a 1 J 

This is always a positive matrix (as the eigenvalues 1 ± cos a are both non-negative) and we then 
apply lemma 2. 

For general values of n we introduce the linearised entropy S\\ n defined by 

Si in = TV (p - p 2 ) 

Substituting (§) we get 

n 

S M = (1 - E^ 2 ) - 2 Eftftl WiWi) I 2 (7) 

i=l i<j 

Hence Sn n is a monotonically decreasing function of each of the overlaps | (V>i | ipj ) | • Next we note 
that for d = 2, the von Neumann entropy S(p) is also a monotonically increasing function of Su n , 
giving our claimed result that S(p) is a monotonically decreasing function of each overlap. To 
see that S is monotonically increasing with Sn n for d = 2 let the eigenvalues of p be A and 1 — A. 
Then 

S = -A log A - (1 - A) log(l - A) 

and 

S lin = A + (1 - A) - A 2 - (1 - A) 2 
Computing 4f and shows that 



dS dS ,dSii n ^ q 



dSn n dX dX 



4 Ensembles in three dimensions 

We will focus on the case that the inequalities in (ENS1) are all equalities: 

|(^|^)| = |(^|^)| foraJli.j (8) 

This will readily imply our basic result viz that the entropy can be increased by increasing the 
overlap of each pair of states. Indeed suppose that £ and £ are two different ensembles (with the 
same prior probabilities) satisfying eq. @ but with S / S. Without loss of generality suppose 
that S < S. Now consider a small deformation of the states of £ which slightly increases all the 
pairwise overlaps, giving an ensemble £* with entropy S* . Then either S* > S (so S* > S giving 
a direct example of our result) or S* < S. In the latter case we will always have S* > S if the 
deformation is sufficiently small, again giving an example of our result (with ensembles £ and £*). 

We will show that for almost any ensemble £ of three linearly independent states (i.e. n = 3 
and d = 3) it is possible to deform £, in two ways such that: 

(Dl) all pairwise overlaps are increased and the entropy increases, 

(D2) all pairwise overlaps are decreased and the entropy decreases. 
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For clarity in this section, it is useful to distinguish (normalised) state vectors, written as kets 
from physical states, which we will denote with square brackets as [ip]. A physical state is a 
full set of all normalised vectors that differ only in overall phase: 

[$] = {e^\ip) : < (/) < 2vr} 

i.e. state vectors are elements of the Hilbert space whereas physical states are elements of the 
projective Hilbert space. The vectors e 1 ^ \ in [ip] are called phase representatives of the state 

[# 

Consider three normalised vectors , \ip 2 ) , IV^) i n %3- We have the overlaps (non-negative 
real numbers): 

012 = 1(^11^2)1 023 = 1(^21^3)1 031 = 1(^31^1)1 
and the triple quantity denoted T123 (generally complex) defined as 

T123 = {ipl \lp2) (lp2\lp3) (^3 

= oi2«2303ie^ for some phase £ 

Note that the real numbers ai2, 023j 031, £ are well defined on physical states rather than just 
on the vectors i.e. if we arbitrarily change the phases of the vectors then these four numbers 
remain invariant. Also for any prior probabilities, the density matrix and entropy of the ensemble 
{l^i) , ^2) > 1^3) ;Pl>P2)P3} is a function of the corresponding physical states. Note also that 
unitary transformations are well defined on physical states and leave the quantities 012, 023, 031, £ 
invariant. 

From the point of view of physics, we are primarily interested in ensembles of physical states 
rather than ensembles of state vectors. In contrast to the overlaps ay and £, the Gram matrix 
depends on the choices of phase representatives. For any ensemble, given any choice of phase 
representative of [ipi] we may always choose representatives of [ife] and [^3] to make {ipi\ip2} arid 
(■01 1^3) real and positive so the Gram matrix has the form: 



G 



' Pi V^lP20i2 yJPlP2,a?,l 

y/PlP2<ll2 P2 y/plpl^e^ j ( 9 ) 

\ ^/PlP3a 31 y/P2P3a 23 e~ l Z p 3 



To any ensemble of physical states we will associate a Gram matrix of this form depending only 
on the invariants ay, £ and the prior probabilities. Then if £ and £ have the same overlaps (as in 
eq. (||)), the Gram matrices must be related as in eq. @ by a matrix of multipliers r of the form 



r(0) 



/ 1 1 1 

1 1 e*+ I (10) 



1 e -< * 



To study deformations preserving overlaps we begin by giving a characterisation of the set 
of all possible triples of physical states {[Va], [^2], [V^]} compatible with a given prescribed set 
ai2, a23 5 031 of overlaps. We clearly have any overall unitary transformation of any allowed triple 
but we are especially interested in triples that are not unitarily related. A complete characterisa- 
tion is given in the following theorem, whose proof is given in the appendix. 
Theorem 1: Suppose we are given real numbers 

< a 12 < 1 < a 23 < 1 < a 3i < 1 
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(a) If {[tpi], [1P2], [V^]} is any set of physical states having 012,023,^31 as overlaps then the phase 

£ of T123 satisfies 

1 + 2012^23031 cos £ > a\ 2 + a| 3 + a ii ( n ) 

Conversely, for any solution £ of eq. (|Tl"l) there is a set of states having overlaps 012, 023, 031 
and triple quantity T123 = 0120230316^. Thus the existence of a solution of eq. ( |Tl| ) is a 
necessary and sufficient condition for a set of real numbers 012,023,031 to be realisable as a 
set of overlaps. 

(b) The solutions £ of eq. ([ll]) give a one-to-one parameterisation of all sets of states up to 

unitary equivalence, that have the prescribed overlaps. We always take £ to be in the 
interval [— ir, n] so if eq. ([IT|) has solutions they are always of the form — £, m ax < £ < £,max 
with £,max < 7T (and we identify the values ±7r if £, ma x = tt)- 

(c) (Explicit formulae up to unitary equivalence) . Suppose that { [ipi] , [1^2], bPs]} is any set of phys- 

ical states having 012, 023, 031 as overlaps. Then there is an orthonormal basis {|0) , |1) , |2)} 
of Ti.3 such that phase representatives of the states are given by: 

\M0) = |o) 

1^(0) = «12 10> + yjl-a 2 12 \l) (12) 

|*0 3 (£)) = ai 3 |0) + " 23 e J~ a23 ° 31 |1) + V 1 - a j2- a 23-"ll+ 2a 12 Ct 23Q31 COS g igV 
\A~°12 V 1_a 12 

Here £ is the phase of T 123.0 

To illustrate theorem 1 and its significance for further developments we describe an example. 

1 1 11 

3' 3> 3- 



Example 1. Consider the ensemble £ = {\ai) , \a 2 ) , \ois) ; i, A, A} where 



|ai) = |0> 

|a 2 ) = ^(|0) + |1)) (13) 
l«3> = ^ (|0> + |1> + |2» 

The Gram matrix has eigenvalues 0.053, 0.145 and 0.802 with entropy S = 0.613 (where natural 
logarithms have been used). According to theorem 1, up to unitary equivalence any ensemble 
with the same overlaps has the form £(£) = {|ai(£)) , \a 2 (0) > \ a 3(0) i 5> 3"; 3"} where 

l«i(0> = |o> 

MO) = ^(|o> + |i» (14) 

M0> = ^|0) + ^i|l) + v /|cose-l|2) 



Here the parameter £ is constrained by eq. (11) giving: 



3 

cos£ > - i.e. £, m ax = 0.72 radians 

G(£), the Gram matrix of £ (£), is positive so long as |£| < arccos |. At the maximum value of £ 
the amplitude of |2) becomes zero and the states become linearly dependent. 

The von Neumann entropy of £(0 may be computed from the eigenvalues of G(£). This is 
shown in figure 1 and we see that the entropy falls monotonically with £ for the ensembles £(0 
which have constant overlaps. (For negative values of £ the graph is reflected in the vertical axis). 
Thus for any < £ < arccos | we may deform £ (£) to obtain an ensemble £* with the same 
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Figure 1: Variation of von Neumann entropy for the family of ensembles in eq. (|TJ) with unchang- 
ing overlaps, as a function of £, the phase of T123. The maximum value of £ is arccos | ~ 0.72 
radians. 

overlaps but having strictly lower (respectively higher) entropy. Then by slightly deforming £* 
to make all overlaps lower (respectively higher) we obtain a deformation of £ (£) satisfying (D2) 
(respectively (Dl)). Note that if £ = (i.e. T123 is real) then we only get (D2) ( and not (Dl)) 
by this method. Also for £ = £ maa; = arccos | we get only (Dl) (by this method) as £ can only 
be decreased. □ 

We now return to general ensembles of three states and study the variation of von Neumann 
entropy with deformations that preserve the overlaps. According to Theorem 1 these deformations, 
up to unitary equivalence, are parameterised by £. We will find that the behaviour exhibited in 
example 1 is generic - the entropy falls monotonically as £ increases from to £ m az just as 
in figure 1. Then the same deformation arguments as in example 1 give the following theorem: 
Theorem 2: Suppose the ensemble £ = , \ip2) > l^) ;Pl)P2>f?3} has rank 3 (i.e. the un- 

normalised states y/pi\ipi) are linearly independent) with overlaps 012,023,031 and phase £0 of 

Ti23- 

(a) If T123 is not real ( i.e. £0 7^ 0j ^ then £ can be deformed according to both (Dl) and (D2). 

(b) If T123 is real positive (i.e. £0 = 0) then £ can be deformed according to (D2). If T123 is 

real negative (i.e. £0 = 7r ) then £ can be deformed according to (Dl).D 

The proof of theorem 2 is given in the appendix. 



Note that in theorem 2(b) we have not ruled out the possibility of a (Dl) deformation (re- 
spectively (D2) deformation) when £ has T123 real positive (respectively negative). We have 
shown only that such deformations cannot be achieved by the particular method of first altering 
the entropy while keeping the overlaps constant and then slightly increasing or decreasing the 
overlaps. Indeed consider the ensemble T of states: 

l^i) = |0> 

hM = ^(|o) + |i)) 
l^) = ^ i |o) + 4ti|i) 

taken with equal probabilities. Here we have T123 real positive so our method cannot give a (Dl) 
deformation. However such deformations do exist, for example the ensemble T' of states: 

W) = |o> 

hf*> = ^(|o> + |i» 

l^) = ^|0) + ^H) + ^|2) 

T' has overlaps greater than or equal to those of T yet its entropy S' = 0.91 is greater than the 
entropy S = 0.85 of T i.e. T 1 is a deformation of T of type (Dl). 

This leads us to conjecture that the conclusions in theorem 2(a) also hold for the ensembles in 
(b) and more generally that any ensemble containing a subset of three states that has no parallel 
or orthogonal pairs, admits deformations of both types (Dl) and (D2). However it is possible 
to show from our results that if such deformations always exist, they are not always realisable as 
continuous deformations of the appropriate type starting with the given ensemble. Indeed let £ 
be any ensemble with T123 complex, consisting of three linearly dependent states, lying in a plane 
P (so that from theorem 1(c) the phase £ of T123 must take the maximum value allowed by the 
overlaps). From the result of section || we know that there are no deformations (continuous or 
not) of type (Dl) or (D2) lying within the plane P. Since £ is planar, its density matrix has 
a zero eigenvalue, say A3 = 0. The entropy function S(X%, A2, A3) has infinite slope at A3 = in 
the A3 direction (and finite slopes in the Ai and A2 directions, for Ai,A2 > 0) so any continuous 
deformation of £ out of the plane P must begin to increase S, regardless of the overlaps. Hence 
no continuous deformations, by any method whatever, of type (D2) can exist (yet there may 
still exist a "distant" ensemble £' with decreased overlaps and decreased entropy (i.e. a (D2) 
deformation) which cannot be connected to £ by a continuous family of (D2) deformations. 



In relation to theorem 1 it is interesting to note that in [17] Gisin and Popescu have described 
another method for (discontinuous) deformation of a particular class of ensembles that preserves 
all overlaps. Let |n) denote the qubit state corresponding to the unit vector n on the Bloch 
sphere. Let £ = {|ni) I'll) , • • • > \Rk) Iffifc) !Pi • • • iPk} be any ensemble of two qubit states where 
each constituent state comprises two copies of a single qubit state. We now replace each second 
component \n£) by the orthogonal state |— Bj) to get £* = {|ni) \—n\) , ■ ■ ■ , \u.k) \ ~ Qk) iPi ■ ■ ■ iPk}- 
Clearly £ and £ * have the same pairwise overlaps but in general the von Neumann entropies 
will be different. Indeed £ always lies within the three dimensional subspace of symmetric states 



whereas £ * generally spans the full four dimensions of two qubits. In [17] the ensemble £ uniformly 
distributed over the whole Bloch sphere was considered. It was shown that £ * allows one to guess 
the identity of the direction n with greater average fidelity than is possible with the states from 
£. This provides another manifestation of the idea that the distinguishability in an ensemble can 
vary while keeping all pairwise overlaps fixed. 
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5 Discussion 



One of the initial motivations for this work was the problem of determining the optimal compres- 



sion of quantum information in mixed state signals [12, |13|. For pure states the optimal com- 
pression is given by the von Neumann entropy of the source (and Schumacher compression g, Q 
provides an explicit asymptotically optimal compression protocol) but for mixed states the value is 
unknown. One method of compressing the ensemble {pi, . . . , p n ;p%, . . . ,p n } of mixed states (where 
the compresser knows the identity of the signals) is to first prepare purifications \ipi) of the states 
Pi and then apply Schumacher compression to the pure state ensemble {\tpi} , . . . , \ip n ) ;pi, ■ ■ ■ ,p n }- 
Thus we wish to construct the ensemble of purifications that has the least von Neumann entropy. 
For n = 2 the solution is given by the purifications with the largest overlap and the correspond- 
ing minimum entropy can be readily calculated from Uhlmann's transition probability formula 



1% 15]. However for three or more states the results in this paper show that the maximum over- 



lap condition is no longer correct and the problem of minimising the entropy is evidently more 



subtle. It has been shown in 16] that the problem of optimal mixed state compression, in full 
generality, may be translated into a problem of minimising the entropy of a suitable ensemble of 
purifications (of blocks of signal states). 

From a mathematical viewpoint our results amount to an investigation of the von Neumann 
entropy S not as a function of a density matrix, but as a function of variables defining an ensemble: 

S = S(p) = S(J2pi (V?j|) = S(\l/)l) \lj) n ) ;Pl, ■ ■ ■ ,Pn)- 

In particular we have considered the behaviour of S when the states \ipi) are varied while the 
probabilities pi are held fixed. In this context many further interesting questions arise. For 
example, given an ensemble, what is the largest entropy that can be attained by deformations 
that make the states more parallel? Conversely given a value So of von Neumann entropy, what 
is the maximum possible (average) overlap of any ensemble that has entropy S$l Are there 
ensembles with the property that every continuous deformation which increases overlaps, begins 
to increase the entropy? We may also consider varying the probabilities. Suppose we have two 
ensembles £ = {|V>i} , IV^) , IV^) ;Pi,P2,P3} and £* = {\4>i) , \<p 2 ) , \fo) ; 91,92,93} where the states 
of E* have greater pairwise overlap than the corresponding states of £. Let cap(£) be the maximum 
entropy attainable from the states in E by varying the prior probabilities. Is it possible to have 
cap(£*) > cap(£)? This question is of particular interest since cap(£) is the classical information 
capacity of a quantum channel using the states of £ as basic signal states H. 

In the introduction we argued that von Neumann entropy quantifies a notion of distinguisha- 
bility for the constituent states of an ensemble and pointed out that it is surprising that dis- 
tinguishability (as well as the lower limit for compressibility) may be increased by increasing all 
pairwise overlaps (for ensembles of three or more states). Thus the notion of distinguishability and 
the redundancy involved in quantum information compression depend not only on the overlaps 
but also on the relative phases of the amplitudes {ipi\ipj). For ensembles with just two states there 
is only one such phase and it is rendered physically irrelevant by the overall phase freedom in a 
physical quantum state. For three or more states the relative phases cannot be eliminated and 
they provide further parameters for issues of compressibility and distinguishability. For general 
ensembles we can consider the quantities: 

T(ii,« 2 , ...,ik) = (^iiivo (vyvy • • • (V'ifc-xi^O (^iV'ti) ( 15 ) 

associated to each subset [ipi^], ■ ■ ■ , [ipi k ] of physical states (noting the cyclic chain of indices 
i\, . . . , ik, i\ returning to i\ to complete the cycle.) These quantities are all unitary invariants and 
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independent of the choice of phase representatives. Chains of length 1 define the normalisation 
while chains of length 2 are just (the squares of) the pairwise overlaps. The modulus of any chain 
of length k is a product of overlaps but its phase is a new quantity. For example for three states, the 
phase of any chain of length 3 is (up to a sign) the parameter £ considered previously. It would be 
interesting to understand the physical bearing of these parameters on issues of distinguishability 
and compressibility. 

We have seen that a family of ensembles with fixed overlaps can exhibit a variation of infor- 
mation content. In particular for ensembles of three states we have the extra quantum mechanical 
phase parameter £. It is interesting to note that an analogous phenomenon may occur in a purely 
classical context [19]. Suppose Alice wishes to communicate information to Bob using three sig- 
nals A,B,C with equal prior probabilities. The signals themselves are probability distributions on 
three values {1,2,3}. A is the uniform distribution on {1,2}, B on {2,3} and C on {1,3}. To 
send a signal, Alice samples the corresponding distribution and sends the result e.g. to send B 
she tosses a fair coin labelled by 2 and 3, and sends the outcome. (Alternatively we may attribute 
the probabilistic nature of the signals to noise in the channel). Bob then reads the received value. 
We may readily calculate the probability p(y\X) that Bob reads y (1,2 or 3) given that Alice 
sent X (A,B or C) and the mutual information I{X : Y) between Alice and Bob. In classical 
information theory, I{X : Y) quantifies the amount of information that Bob gets about Alice's 
message i.e. the information capacity of the communication protocol. Consider now the same 
scenario with three new signals A',B' and C which are probability distributions on four values 
{1,2,3,4}. A' is the uniform distribution on {1,4}, B' on {2,4} and C on {3,4}. The mutual 
information is now different but in any reasonable sense, the signals A', B' and C have the same 
pairwise overlaps as the corresponding signals from the original set A,B,C - in every case the 
two distributions coincide on one value and are disjoint on the other value. Thus in this purely 
classical context we again have the phenomenon that the global information content (in the sense 
of mutual information here) differs even though the pairwise overlaps (and pairwise information 
contents) are the same. 

There are various other possible natural concepts of distinguishability which one can asso- 
ciate to an ensemble {\ipi) , ■ ■ ■ , \ip n ) ;pi, . . ■ ,p n } of pure quantum states. Two such concepts are 
the accessible information and the minimum error probability [18|. For the latter, we attempt 
to identify the state by applying a measurement M with outcomes 1,2, ... ,n. Let p(j\i) be 
the probability of obtaining outcome j for the input state The error probability is defined 
by -Perror = J2iPiO- ~ We choose M. to minimise Pen-or and use that minimum value 

as a measure of distinguishability for the ensemble (where a smaller value indicates greater dis- 
tinguishability). The accessible information of an ensemble is the maximum Shannon mutual 
information of any POVM measurement on the ensemble states. For n = 2 both of these mea- 
sures are monotonic functions of the overlap | (V^IV^) | but for n > 3, like the von Neumann 
entropy, they are not functions of the overlaps alone. Thus it seems plausible that they too - like 
the von Neumann entropy - will exhibit increased distinguishability with suitable deformations 
of the ensemble that decreases all pairwise overlaps. But unlike the von Neumann entropy this 
is difficult to study analytically: the computation of minimum error probability or accessible in- 
formation involves difficult optimisations and is generally intractible analytically for all but the 
simplest ensembles. 
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6 Appendix 

6.1 Proof of Lemma 1 

Suppose that the two ensembles are unitarily related. Then it is easy to see that they have the 
same Gram matrices (as the inner products are unitary invariants). 

Conversely suppose that Gi = G2. Then the number of states and corresponding probabilities 
(being the diagonal of the Gram matrix) must be the same. So write 

81 = {\a{) \a k ) . . . ,p k } 

and 

82 = {\Pi) , • • • , \Pk) ;Pi, ■ ■ ■ ,Pk} 

We will use proof by induction on k to show that |aj) = U \ Pi) for some unitary U. The result is 
clearly true for k = 1 i.e. ensembles with only one state. Assume (the induction hypothesis) that 
it is true for all ensembles of k states. Consider two ensembles of k + 1 states: 

£1 = {\ai) , . . . , \a k ) , \a k+ i) ;pi,... ,Pk,Pk+i} 

£2 = {\Pi) , • • • , \Pk) , \Pk+i) ;pi, ■ ■ ■ ,Pk,Pk+i} 

with the same Gram matrices. Then the sub-ensembles of just the first k states (with probabilities 
rescaled by 1/(1 —pk+i)) will have the same Gram matrices so by the induction hypothesis there 
is a unitary U with 

\/3i) = U\(Xi) for % = 1, . . . , k 

Now compare the ensembles U{£\) and £2- They differ only in their (k + l) th states which 
are respectively U \a k+ \) and \Pk+i)- Let B = span(|/?i) , . . . , \@ k )). Consider the parallel and 
perpendicular projections with respect to B: 

U\a k+1 ) = (Ua) {l + (Ua) ± 

\(3 k+1 ) = (/J),, + (f3) ± 

Since £\ and £2 (and hence also U{£\)) have equal Gram matrices, U \ot k+ i) and \(3 k+ i) have equal 
inner products with \(5\) , . . . , \(3 k ) and hence with a basis of B. Thus the parallel projections (Ua)\\ 
and (/3)|| are equal and so the perpendicular projections have the same length. If this length is 
zero then £2 = U£\. If it is not zero, let B 1 - be the two dimensional space spanned by (Ua)± and 
(f3)± and let V be a unitary transformation in B L with V(Ua)± = (/3)±. Finally let U' be the 
unitary transformation which is the identity in B and V in B 1 - . Then £2 = U'U£\, completing 
the proof of the lemma. □ 

6.2 Proof of Theorem 1 

For any set of states {[^1], [1P2], {^3]} we can always choose phase representatives making (V'llV^) 

and (V'llV's) rea l non-negative. Hence: 

{[■0i], M, [^3]} will have overlaps ai 2 ,a 2 3,a 3 i 

iff there are representatives with 

(V'llV^) = ai2 (^11^3) = 031 and (^2^3) = a23^ for some ^ (16) 
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iff" 

if 1 fl 12 «31 \ 

G(£) defined by - au 1 0236^ is a Gram matrix (17) 
\ 031 a 23 e~ l Z 1 J 

iff 3G is a positive matrix (where G is the matrix in eq. (|l7|)) 
iff the eigenvalues Ai, A2, A3 of 3G are all non-negative. 

Next we claim (and prove in the next paragraph) that the last condition holds iff det 3G > 0. 
Then computing det3G directly from G above, gives the required condition (11). Note also from 
(|l~6|) that £ is the phase of the triple quantity T123 and this completes the proof of (a). 

To justify the claim above, we show that 3G cannot have exactly two negative eigenvalues. 
Consider f3 = A1A2 + A2A3 + A3A1 and suppose that there are two negative eigenvalues Ai and A2. 
Then Tr 3G = 3 = Ai + A 2 + A 3 gives 

P = AiA 2 + (Ai + A 2 )(3 - (Ai + A 2 )) = -Xj -Xl- AiA 2 + 3(Ai + A 2 ) 

Thus j3 < as all terms are negative. But (3 is the linear coefficient in the characteristic equation 
of 3G and a direct calculation of det (XI — 3G) gives 

P = 3 - «12 - «23 - «ii 

so that P > 0. Hence 3G can never have exactly two negative eigenvalues and the non-negativity 
of det3G = A1A2A3 is equivalent to the non-negativity of all three eigenvalues. 

To prove (b) we recall that T123 and hence £, is an invariant of any overall unitary transfor- 
mation. Hence for fixed 012,^23 and 031, different £'s correspond to unitarily inequivalent sets of 
states. 

To prove (c) suppose that {[V'l]; [V^L [^3]} is anv se t of states with the given overlaps. We 
choose an orthonormal basis {|0) , |1) , |2)} of H3 and phase representatives \ipi) as follows. Choose 
an arbitrary phase representative \ipx) of [ipi] and set 

10) = \ih) 

Choose |1) orthogonal to |0) in the plane of and [^2] (with phase of |1) to be fixed later). 
Then any representative \ip2) of [^2] h as the form 

1^2) =a 12 e ia |0)+/3|l) 

Choose the overall phase of \ip2) to make a = and choose the phase of |1) to make P real and 
non- negative, so 



|^)= 012 |0) + yjl-ai 2 \l) 

Then choose |2) orthonormal to |0) and |1) (with phase to be fixed later) so any representative of 
[^3] has the form 

\iP 3 )=u\0)+r ) \l) + (\2) 

Choose the overall phase of ^3) to make to real non-negative sow = 031. Choose the phase of |2) 
to make £ real non-negative. Then 

IV's) = a 3i |0) + V |1) + z 1 2) with z real > 0, rj complex (18) 

We have two further conditions: 

I (^2^3) I = a 23 = |a 3 iai2 + \Jl - af 2 ry| (19) 
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and normalisation: 

z 2 = 1 - \ V \ 2 - a 2 31 (20) 



From eq. ( |T8 ) we introduce £ so that 

a 3 iai2 + \Jl- a\ 2 r] = a 23 e i? 

This gives r\ parameterised by £: 



Then eq. (pOj) gives 



2 _ 1 ~ a 12 - a 23 ~ a 31 + 2a 12a23a31 COS £ 



a 2 3e 4? - a 3 iai2 , . 

V = ; — (21) 

l-a\ 2 



' 1 ai2 a 3 i 
det a\2 1 e^a2 3 
y a 3 i e~^a 23 1 



1 - af 2 1 - of 2 



(22) 



Substituting eq. (21) and eq. (22) into eq. ( jig ) gives the required form of | ^3) . Note that the 
condition z 2 > in eq. (p2|) reproduces the condition (O) in part (a) of the theorem. □ 



6.3 Proof of Theorem 2 

Given the ensemble £ = {l^i) , \ip2) , IV^} \Pi,P2iPz\ let denote the family of ensembles (up 
to unitary equivalence) which have the same overlaps as £. Here £ is the phase of T123. We study 
the variation of 5 with £ through a sequence of lemmas (with proofs given at the end): 

Lemma Al: The ensembles £(£,) have constant Trp and constant Trp 2 . Hence S may be 
viewed function of Tr p 3 = Tr G . 

Lemma A2: If Trp and Trp 2 are held constant then S is a monotonically increasing function 
of Trp 3 . 

Lemma A3: For any ensemble £ , Tr p s has the form 

Tr p 3 = C + 6pip 2 P3ReTi23 = C + 6pip 2 P3ai2«23a3i cos £ (23) 

where C is independent of £. 

In view of these lemmas we have: 

Proof of theorem 2: We consider the family of ensembles £ (£) with the same overlaps as 
£ , parameterised by £, the phase of T123. 

(a) If T is not real and £0 ±£max (as £ has rank 3) then we can perturb £0 slightly in both the 
positive and negative directions inducing either an increase or decrease in Tr p 3 (by lemma A3) . 
Hence by lemmas Al and A2 we may either increase or decrease S while keeping the overlaps the 
same, giving the possibilities (Dl) and (D2). 

(b) If £0 = then cos £0 is at its maximum value. Thus any perturbation of £0 leads to a decrease 
in Trp 3 and hence a decrease in S (while keeping all overlaps constant). Thus we get (D2) only. 
At £0 = 7T) cos£o is minimum so similarly, we get only (Dl).D 

Proof of Lemma Al: We have Trp = 1 for all f. Also Trp 2 = TrG 2 = Y.ijGijGji = 
\Gij\ 2 as G is Hermitian. Since £ (£) have constant overlaps, the Gram matrices are related by 



Hadamard product with r of the form in eq. ( |10[) with \rij\ = 1 for all Hence Trp 2 remains 
constant. The three eigenvalues of p are uniquely determined by the values of Trp, Trp 2 and 
Tr p 3 so S may be viewed clS cl function of Tr p 3 . □ 
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Y(0,1,0) 




Z(0,0,1) 



Figure 2: The triangle XYZ is the probability simplex V3 (or space of diagonal density matrices 
diag(Ai, A2, A3)) in d = 3 dimensions. It is obtained by intersecting the plane Ai + A2 + A3 = 1 
with the positive octant in the space of all triples (Ai,A2,A3). The three vertices are the three 
pure states and the edges are rank 2 diagonal states. The central point M = diag(|, 3,3) is the 
maximally mixed state. Along lines MX. MY and MZ two eigenvalues are equal. We introduce 
polar co-ordinates on this simplex with r being the distance from M and the polar angle is 
measured anticlockwise from the line MX. The co-ordinates for a general state are given in eq. 



(2-4). The arc a/3 includes all diagonal states p (up to a permutation of the diagonal entries) at a 

1 

H 



constant radius r with a fixed Tr p 2 = A + r 2 (cf eq. fl26]) et seq 



Proof of Lemma A2: Note first that S is unitarily invariant so we may assume without loss 
of generality that p is diagonal, and work on the classical probability simplex 

V 3 = {(Ai, A 2 , A 3 ) : Ai > 0, A 2 > 0, A 3 > and Ai + A 2 + A 3 = 1} 

To represent the constraint Tr p 2 = const it is convenient to introduce a polar co-ordinate system 
(r, 8) in V3 as shown in figure 2. r is measured from the centre M = (A, i, |) of V3 and 8 is 
measured anticlockwise from the line joining M to the vertex (1,0,0). 
A direct calculation gives the co-ordinates of a general state as: 

Ai = I + y|r cos 8 

A 2 = ! + v /|rcos(0 + f ) (24) 
h = l + ^rco S (8 + f) 
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The constraint 

Tr p 2 = A • = A 2 = const (25) 

corresponds to the intersection of the simplex XYZ with a sphere of radius A centred at (0,0,0). 
This gives a circle or part of a circle (as shown in figure 2) within the simplex. In terms of polar 
co-ordinates we get 

H^ = \+ r2 (26) 

In figure 2 the lines MX, MY and MZ divide the circle into six symmetrical parts corresponding 
to the six possible permutations of the eigenvalues. Since we will be interested only in the (un- 
ordered) set of values of the A's it suffices to consider just one of these regions. Thus without loss 
of generality we may take 

o<.<*_<! 

For some values of constant Tr p 2 or r, the angle 9 will have a maximum value 9 max smaller than 
7r/3 e.g. for the r shown in figure 2 we take only the arc a(5. At the point /3 one of the eigenvalues 
has become zero. 

From eq. (f26|) we see that the constraint Tr p 2 = const corresponds to r being constant. Using 
eq. (pi) we get: 

11 

Trp 3 = V Xf = - + r 2 + -^r 3 cos36> 

Since 6 lies in the range [0, |[] we see that Tr p 3 is a monotonically decreasing function of 9. 

Next we show that S is also monotonically decreasing with 0. To study the variation of entropy 

S = - ^ Aj log A, 
with 6 when Tr p 2 is held constant, note that X] Aj = 1 so we get 

Er 1 < 27 > 

and so 

ds ^d\^ 

Since < 6 < ^ we have from eq. ( |24| ) 

dx 1 dx 1< dx 1> 

89 ' 89 ~ 89 ~ 



Hence using eq. (p^) we get 



8S . dXi . Ai .dA 2|1 A 2 

89 =l W llog T 3 + l ^9 llog T 3 



Using the inequality log x < x — 1 we get directly: 

8S 

with equality possible only for 9 = or ir/3. 

To summarise, any given p corresponds to a unique point (r, 9) with < 9 < -k/3. If Trp 2 = 
\+r 2 is held constant then both Tr p 3 and S are monotonically decreasing functions of 9. Hence 
S is a monotonically increasing function of Tr p 3 which completes the proof of Lemma A2. □ 
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Proof of Lemma A3: We have 

Tr p 3 = Tr G 3 = ~T G l3 G jk G kl (28) 
ijk 

Since p is invariant under choices of phase representatives we may assume without loss of generality 
that G has the form given in eq. (~~). Then in eq. (~~8|) the £ dependence arises only through three 
terms (via cyclic permutation of the subscripts) of the form G23G31G12 = P1P2P3Y 123 and the 
three corresponding complex conjugate terms G32G21G13 = PiP2P3^i23- This gives eq. ("~"~) .D 
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